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Abstract. We determine, to within 0(1), the expected minimal position at level n in certain 
branching random walks. The walks under consideration have displacement vector {vi,V2, ■ ■ .) 
where each Vj is the sum of j independent Exponential(l) random variables and the different Vi 
need not be independent. In particular, our analysis appUes to the Poisson-Dirichlet branching ran- 
dom walk and to the Poisson-weighted infinite tree. As a corollary, we also determine the expected 
height of a random recursive tree to within 0(1). 



1. Introduction 

A branching random walk starts from an initial particle, the root, with position 0. The root pro- 
duces some number of children, who are randomly displaced from their parent according to some 
displacement law. Each child in turn produces some number of children, who are displaced from 
the position of their parent according to the same law; and so on. In general, the displacements of 
sibUngs relative to their parent may be dependent, but for distinct particles v and w, the displace- 
ments of the children of v and of the children of w must be independent. When the displacements 
are non-negative, this is often called an age-dependent branching process, and the displacements are 
thought of as "times to birth". 

There is a natural tree associated with a branching random walk, where the vertices correspond 
to particles, and an edge from parent to child is weighted with the child's displacement from its 
parent. More precisely, let T be the Ulam-Harris tree, which has vertex set V = IJ^o 
think of elements of N" as concatenations of n integers, and take N° = {0}), is rooted at 0, and has 
an edge from v to vi for each v and each i € N. We call N" the n'th generation of T, and for 
V = v\ . . .Vn ^ N*^, we say that v has parent p{v) = vi . . . Vn-i and children vi, i G N. (We will 
usually write r„ in place of for readability.) 

Now suppose X = {Xi : z G N) is a random vector, where each Xj G M U {+oo}. We do 
not require that the entries of X are independent of one another - this will be important below. 
Then we form a branching random walk by marking each vertex v G V with an independent copy 
= {XV :ie N) of X. Write T for the pair (T, {X.'" : v e V}); then T is our branching random 
walk. We call X the displacement vector of T. ^ For each v G V and i G N, we regard Xf as 
the displacement from v to vi, and let S{v) = S{v, T) be the sum of the displacements on the path 

from the root to v (formally, iiv = vi...Vn then S{v) = Yl'i=i Xv\^^"'^''\ and this sum is taken 
to be -l-oo if any of its elements are +oo). We say T has finite branching if almost surely all but 
finitely many coordinates of X are equal to +oo. 
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For n G N, let M„ = inf (^(f ) : v G N"). In all situations we consider in this paper, this infimum 
is attained, so Mn is the minimal displacement of any individual in the n'th generation. The minimal 
displacement is one of the most well-studied parameters associated with branching random walks. 
It has been known since the 1970's [Ham74, Kin75, Big76] that under quite general conditions, Mn 
grows asymptotically linearly with lower-order corrections. Recently there have been substantial 
developments in understanding the finer behavior of M„ on two fronts: first, convergence results 
for the lower order corrections [ABR09, AS 10, HS09]; and second, the concentration of M„ about 
its mean (or median) [ABR09, BZ09, CD06]. We refer to these as the global behavior and the 
local behavior of M„, respectively. Under suitable conditions, M„ generally seems to exhibit the 
following behavior: for some constants a G R and /? > 0, median(M„) = an + /31ogn + 0(1), 
and furthermore, Mn/n a almost surely and (M„ — an)/ log n ^ ^ in probability (but not 
almost surely [HS09]). Also, under sufficiently strong moment conditions for the displacements, 
E {exp(7|M„ — E -M„|)} < oo for some 7 > and all n. (In fact, in some cases the upper tail of 
Mn — E Mn is even known to decay doubly-exponentially quickly [BacOO, FKLIO].) 

To date, however, all the results of the kind described in the preceding paragraph that we are aware 
of, require that the branching random walk has finite branching. In this paper we study the global 
behavior of Mn for a class of branching random walks which do not have finite branching. The class 
we consider is rather restricted but nonetheless contains at least two interesting special cases, one 
related to the factorization of random integers, and one related to the analysis of algorithms. Say 
that X has exponential steps if for all i, Xi is distributed as the sum of i independent Exponential 1) 
random variables. The main result of this paper is the following theorem. For short, we denote 

Mn = median(M„) := sup{x : P{M„ < x} < 1/2}. 

Theorem 1.1. IfK has exponential steps, then 

~ n 3 

M„ = - + — logn + 0(l). 

Remark 1: The 0(1) term is uniform over n and over all BRW for which X has exponential steps. 
Remark 2: Independently of the current work, Elie Aidekon [Aidll] has recently proved, for a 
quite general family of random walks (including those considered in this paper), that M„ — Mn 
converges in distribution to a random variable M* , and describes the distribution of M* in terms of 
a functional of the limit of the derivative martingale associated to the branching random walk. 

Using methods from [FKLIO], we can deduce from Theorem 1.1 uniform exponential tails for 
Mn- In the next theorem and at other points throughout the paper, we will use the Vinogradov 
notation f <^ g which means / = 0{g), with subscripts indicating dependence on any parameter, 
e.g. / <Cfc g means the constant impUed by the ^ symbol may depend on k but not on any other 
variable. 

Theorem 1.2. Tjf X has exponential steps, then for any c\ < e, we have 

F{Mn ^Mn-x} <ci e-"i^ {n^l,x^O), 

and for any ci < 1, 

F{Mn ^Mn + x} <C2 e-"^"" {n^l,x^O). 

Again, the above estimates are uniform over all BRW under consideration. Also, Theorem 1.2 
implies that M„ = EM„ + 0(1), and so both Theorems 1.1 and 1.2 hold with M„ replaced by 

EMn. 
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The simplest example of a displacement vector with exponential steps is obtained by taking 
X = {El, El + E2, . . .) where are lid Exponential 1) random variables. In this case T is 

called the Poisson-weighted infinite tree [AS04] and has been used very effectively in probabilistic 
combinatorial optimization. It also arises in the analysis of an important tree-based data structure 
in the following way. Order the elements of T in increasing order of displacement as {u'jjigN, so 
in particular we have wi = 0, 7«2 = 1 £ N^, and either 103 = 2 G or 1(73 = 11 G N^. Now 
for each m let Zm be the subtree of T induced hy wi, . . . , Wm- By the memoryless property of 
the exponential, it follows that the parent of Wm+i is a uniformly random element of - in other 
words, Zjn is a random recursive tree for all m. This connection is well known [Pit94].) 

Zm is also the subtree of T induced by the set of nodes of displacement at most S{wm)- (Also, 
it is straightforwardly shown by induction and the memoryless property of the exponential that the 
families (Zm)meN and {S{wm))meN are independent, but we will not need this.) Let Hm be the 
height of Zm - the largest generation containing a node of Zm- In other words, Hm = max{n : 
Mn ^ S{wm)}, which is the representation that will be useful below. Devroye [Dev87] showed 
that Hm/ log m — > e almost surely and in expectation, and Pittel [Pit94] provided a different proof 
of the almost sure convergence. As a straightforward consequence of Theorems 1.1 and 1.2, we 
obtain the following more precise information. 

Corollary 1.3. The height Hm of a random recursive tree on m nodes satisfies E Hm = e log m — 
I log log m + 0(1). Furthermore, for all d < all 1, k ^ 1, 

F{\Hm--M.Hm\^k} <e'e-^'^ 

Since the proof of this corollary is very short, we include it in the introduction. In the proof we 
write har(s) = J2i=i ^A- 

Proof The random variable S{wm) is distributed as the sum, Fi + . . . + Fm-i, of independent 
random variables with Fi having Exponential(i) distribution for z = l,...,m — 1. Equiva- 
lently, S{wm) is distributed as the maximum of m — 1 iid Exponential (1) random variables. Thus, 
E S{wm,) = har(m — 1) and for all x > 0, 

F {S{wm) ^ har(m - 1) + x} ^ (m - i)e-(har(m-i)+x) ^ ^-x^ 

F {S{wm) ^ har(m - 1) - x} = (l - e-(ha'-('"-i)-^)) ^ e'^^'' . (1.2) 

Now write 

~ 3 
d{m) = max{77, : M„ ^ har(m — 1)} = elogm — - log log m + 0(1), 

and note that Md{m) = har(m — 1) + 0{1) by Theorem 1.1. It follows that for ^ 1, if H^ ^ 
(i(m) + k then either 

k ~ k 
Md{m)+k ^ har(m - 1) + ^ ^ ^<i(m)+fc " ^ + 

or 

k 

S{wm,) > har(m " 1) + ^• 

By Theorem 1.2 and (1.1), it follows that F {Hm ^ d{m) + k} <ci e^'^i'^/^^e) for each ci < 
e. A similar argument using Theorem 1.2 and (1.2) shows the bound F{Hm ^ d{m) — k} <^C2 

g-C2fe/(2e) foj. C2 < 1. □ 
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Another important example of a displacement vector with exponential steps arises from a discrete 
time random fragmentation process. Let C/i , C/2 , . . . be independent uniform [0, 1] random variables. 
Set Gi = Ui and for z > 1 set = (1 — Ui) • . . . • (1 — Ui-\)Ui. The distribution of the sequence 



was first studied, in greater generaUty, in [Hal44]. (One motivation for Halmos' paper was a problem 

about loss of energy of neutrons after many collisions; after each collision the neutron loses a ran- 
dom fraction of its current energy.) G is also a special case of the Griffiths-Engen-McCloskey GEM 
distribution. Further, Go.(2), • • •) has the Poisson-Dirichlet (or PD) distribution, where 

(7 : N — )■ N is the permutation that arranges the terms of (Gi, G2, . . .) in decreasing order. (We 
remark that both the GEM and the PD distributions as defined above are in fact special cases from 
a more general two-parameter family of distributions [Pit06] - in the standard notation, we are 
considering the GEM(0, 1) and PD(0, 1) distributions.) The PD distribution arises in a number of 
natural decomposition situations, such as factorization of large random integers [Bil72, DG93] and 
cycle lengths of random permutations [Pit06]. 

Letting = — log Gk for each k yields a vector (Xi, X2, . . .) with exponential steps. We refer 
to the resulting branching random walk as a Poisson-Dirichlet branching random walk. This exam- 
ple has more complicated dependence between the than the first example. Since Yli^i Gi = 1 
almost surely, there is another way to think of the branching random walk. Imagine that an object of 
mass m is placed at the root 0. The root divides this mass into pieces according to the vector G*^ and 
sends the pieces to its children, sending a mass mG\ to its fc'th child. This rule is repeated recur- 
sively, so each node v sends proportion of the mass it receives to its fc'th child vk. This structure 
is variously called a multiplicative cascade or, more commonly at the moment, a fragmentation 
process [Ber06]. The special case of Theorem 1.1 when T is a Poisson-Dirichlet branching random 
walk is used in [FKLIO] to analyze a tree model related to primality testing, proving heuristic ev- 
idence for the behavior of the distribution of tree heights. In this special case of a PD branching 
random walk, a much stronger estimate for the right tail of Mn was proved in [FKLIO], namely for 
any C3 < 1, 



where C4 is a constant depending on C3. Such a right tail bound cannot hold in general; for example, 
for the case of T being a Poisson-weighted infinite tree, we have P{Mi ^ x} = e~^. (It seems 
likely that among branching random walks with exponential steps, the Poisson-weighted infinite tree 
and the Poisson-Dirichlet branching random walk are extremal examples, with the former having 
the heaviest tails for Mn — Mn and the latter the strongest tail bounds for M„ — Mn. However, we 
do not have a precise conjecture in this direction.) 

The Pratt tree for a prime p has root p whose children are the prime factors of p — 1; the subtrees 
of the children of the root are recursively constructed in the same fashion (stopping when p = 2). 
We let H{p) be the height of the Pratt tree for p. It is easily seen that the height is always at most 
(log p)/ (log 2) + 1. Such trees were used by Pratt [Pra75] to show that if p is prime, then there 
exists a certificate (formal proof) of the primahty of p, of length 0[H[p) logp) = 0((logp)^). It is 
then of interest to understand the "typical" behavior of H{p). [FKLIO] uses Theorems I.I and 1.2 
to support the following conjecture. 

Conjecture 1.4 ([FKLIO], Conjecture 3). There exist constants c,(/ > and real numbers {E{p) : 
p prime} such that 

• H{p) = e logp - I log logp -I- E{p), 



G = (Gi,G2,...) 
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• for all z ^ 0, and x ^ 0, 



e ^'^tt(x) <C I {primes p ^ x : E{p) ^ z}\ <^ e '^^7r(x), 



and 



Kprimesp ^ x : E{p) ^ —z]\ <C exp(— e'^'*)7r(x). 



Here ■k{x) is the number of primes which are at most x. 

The structure of the remainder of the paper is as follows. In Section 2 we introduce a little 
additional notation. In Section 3 we use straightforward calculations to prove weak bounds on the 
likely value of M„, and to "reduce the search space" of nodes in T„ which have a chance of attaining 
the minimal displacement M„. Section 4 studies the sample path properties of a uniformly random 
element of certain "homogeneous" subsets of r„, and forms a key step of the proof. In Section 5 
we prove the lower bound of Theorem 1.1, and in Section 6 we prove the upper bound. Finally, the 
details of the proof of Theorem 1.2 are found in Section 7. 



Given v = viV2 ■■ - Vn £ we let h{v) = Y^^=i'^i^ ^^^^ remark that S{v) has distribution 
Gamma(/i(i;)). If f G T„, we write k{v) = h{v) — n, and write T„ ^ for the set of nodes f G T„ 
with k{v) = k. We denote by r„(x) (resp. Tn^k{x)) the set of nodes of r„ (resp. T„,fc) with 
displacement at most x. 

The Bachmann-Landau notations o() and 0() have their usual meaning. As mentioned earlier, 
we use the Vinogradov notation f <^ g which means / = 0{g). We also use the Hardy notation 
f >^ g which means / = 0{g) and g = 0(/). Constants implied by these symbols are absolute 
unless otherwise indicated, e.g. by a subscript. 



In order to restrict the set of nodes we need to consider when searching for the precise location 
of we first assert the following two straightforward facts, whose proofs are forthcoming. 

Lemma 3.1. (a) The expected number of nodes u G r„ with \h{v) — (1 + l/e)n| ^ ^/n and with 
S{v) < n/e + logn/(2e) is » 1. 

(h) The expected number of nodes v ETn with S{v) ^ n/e + (2/e) \ogn and with \h{v) — (1 + 

l/e)nj > Y^6n log n /i' 0(n^^/^). 

Together, (a) and (b) suggest that in order to find Af„, it should suffice to look at nodes in T„ 
satisfying h(v) = (1 + l/e)n + 0{^/n), as will indeed be the case. In proving (a) and (b), we will 
in fact prove more general bounds that will be useful throughout the paper. 

We first remark that forv eV with h{v) = h, S{v) has density function 



For all n ^ 1, A; ^ 0, we have 



2. Notation 



3. Some basic expectations 




(3.1) 



so the sum of the density functions for nodes v G jt is 
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This function will play a significant role, and we now derive bounds on its value for a variety of 
ranges of k and x. We remark that assertions (a) and (b), above, state in particular that to find M„ 
we should take both A; and xnearn/e. Thus, writing A; = (n+r)/eandx = (n+y)/e, by StirUng's 
formula we have 

n + y \n + r \ n + rj \ nJ iir 

When r = 0{^/n), y = 0{y/n), we have (1 + yjnf'' x and 

(1 - (r - y)/(n + r))("+'')/^ x e-^^'V)!^^ 
and so obtain the simpler approximation 

n 

Consequently, 

/•(logn)/2 y 

E|{i;Gr„,fe:5(i;)^(n+|logn)/e}|x / - x n'^/^ (3.3) 

JO ^ 

for any fixed k = n/e + 0{^/n) - where the constants impUcit in 0{^/rl) and in (3.3) may depend 
on each other - and so we obtain 

IE|{w G T„,fc : S{v) ^ {n+llogn)/e,\k-n/e\ ^ > 1. 

This justifies claim (a) of Lemma 3.1, and we now turn to Lemma 3.1 (b). The next lemma is 
[FKLIO, Lemma 5.1], and we give a different proof below. 

Lemma 3.2. For all n and x ^ 0, 

E|rn(x)| = — . 

n! 

Proof. We have 



E 



fX n 

\Tn{x)\ = Y.T. nsiy) ^x} = Y, u,u{t)dt = -. □ 



It follows immediately from Lemma 3.2 and Stirling's formula that the median of Mn is ^ 

f + ^logn + 0(l). 

We next obtain bounds on the probability that k is very different from x when x ^ n/ (2e). First 
we quote easy bounds for the tails of the Poisson distribution. 

Proposition 3.3. Ifz>0 and < a ^ 1 ^ (3 then 



Ez^ / e \ z'^ f eV 



az 



k^az k^0z 

Proof. We have 

^ kl ^ k\ [aj ^\aj ^ k\ \a 

k^az k^az k^az 

The second inequality follows in the same way. □ 
An easy corollary is the following. 
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Lemma 3.4. ForO^t^ x^/^, 



^ fn,k{x) < e" 

{k:\k-x\^t^} 



e/2_ 



„n-l 



(n-1)!- 



Taking t = [-y/51og n] and integrating the above bound over n/e ^ x ^ n/e + (2/e) logn, we 
obtain the bound 



E 



e 



S{v) ^ " + \h{v) - Siv)\ ^ 



n 



O 



1 



n 



1/2 / • 



Since -v/5nlogn+ (2/e) log n < \/6nlogn for n large, combining the preceding expectation bound 
with Lemma 3.2 (applied with x = n/e) and StirUng's formula it follows that 



El 



(j T„,fc((n + 21ogn)/e) 

{k:\k—n/ e\^^/(inlogn} 



o 



n 



1/2 y ' 



which establishes Lemma 3.1 (b). 



4. Randomly sampled random walk 

For integers n > 1, ^ and a vertex v = vi - ■ -Vn € T„_fc, let hi{v) = h{vi . . .Vi) and 
Wi{v) = S{vi . . .Vi) for I ^ i ^ n, and write W{v) = {Wi{v)', Wn{v)). We write W, W, 
and hi in place of W(f ), Wi{v) and /ij(f ) when v is clear from context. We will always write v„ 
for a uniformly random element of fc, independent of v„/ for (n, A;) / {n', k'), and write Wn,fe 
for the distribution of the sequence yV(v„ fc) = (VFi(v„ fc), . . . , Wn{vn,k))- Although the sequence 
0, Wi , . . . , Wn is not a random walk, it is useful to think of it as such for the purposes of estimating 
various probabilities. 

Denote by 'Hn,k the set of vectors {hi,. . . , hn) of positive integers with < /ii < • • • < /i„ = 
n + k and note that \T-Ln,k\ = The sequence (/ii(v„,jk), . . . , hn{vn,k)) is distributed as a 

uniformly random element of 'Hn,k- 

For V G T„, let = La{v) denote the event {Wi ^ {i/n)Wn — a{i ^ n)}. A vertex v is called 
leading if Lq{v) holds, and - informally - near-leading if La{v) holds for some small a. (We also 
will need to consider the event Ra{v) = {Wi ^ (i/n)W„ + a (z ^ n)}, and when this event occurs 
we say v is "near trailing".) 

If Mn is not much larger than normal, v is the vertex at level n with minimal S{v) and Wi ^ 
(i/n)Wn — c for a large c, then Mj will be smaller than normal and this is rare. Hence, with high 
probability v will be a near-leading vertex. On the other hand, near-leading vertices are rare - a 
given vertex in T„ is near leading with probability 0(/(a)/n) for some function /. It will turn out, 
as in prior work [ABR09], that E M„ is within 0(1) of the smallest x such that the expected number 
of leading nodes with displacement at most x is at least 1. 

In this section, we develop estimates for the probability that vertices of T„ ^ are near-leading. As 
in [ABR09], we also show that for a near-leading vertex v, it is rare for Wi{v) — {i/n)Wn{v) to be 
small if i is far away from and far from n. This useful fact will play an important role in the proof 
of Theorem 1.1. 

The next proposition, stated without proof, follows from the well-known fact that a Poisson 
sample becomes a uniform sample once conditioned on the position of the nth point. 
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Proposition 4.1. For any positive real numbers 61, . . . , 6„ and B, and any v G T„, 

P(W, ^h{i< n)\Wn = = P ^ I (z < n)^ , (4.1) 

P(VF» ^hi{i< n)\Wn = i?) = P ^ I (i < n)^ . (4.2) 

Proposition 4.1 allows us to rescale the values Wi to choose a convenient value for Wn- for given 
S', letting b[ = bi- B' / B, the proposition implies that 

^{Wi ^bi{i< n)\Wn = B}= F{Wi ^ b'^ {i < n)\Wn = B'}. 

We will use this fact rather casually in what follows. We will also use the following variant of a 
well-known fact about cychcally exchangeable sequences. 

Proposition 4.2. For any S > 0, 

P{^o(v„,fc)|H^„ = S} = P{i2oKfe)|W„ = S} = ^. 

Proof. For ^ Z < n, let Wn+i = Wn + Wi. Then, for each ^ Z < n and all < j ^ 
n, let wf'' = Wjj^i - Wi. Then for all I, Wn"^ = Wn- Furthermore, each sequence W^'^ = 

{wf^ , • • • , Wn'^ ) has distribution >V„.fc and a.s. exactly one of them is leading by the Cycle Lemma 
[DM47]. Similarly, exactly one of the sequences W^'^ is "trailing". □ 

The following straightforward fact essentially says that conditioning on any subset of the differ- 
ences hi — ho,...,hn — hn-i breaks the sequence into independent subsequences with distributions 
from the same family. The proof is omitted. 

Fact 4.3. Fix integers n ^ 1, k ^ 0, and let {Wi, . .., Wn) have law Wn,k- Then for any integers 
1 ^ i ^ m ^ n, and 1 = no < ni < . . . < = n, conditional upon hi — hi-i, the sequence 

{Wn,_,+l-Wn,_„...,Wn,-Wn,-l) 

has law Wn,-n,^i.{iH-fH^i)-{n,-n,^i), and is mutually independent of {hi, hn), of {Wi, Wn^.j), 

and of {Wn,+1 -Wn„...,Wn- Wn-l). 

The next two lemmas are analogs of Lemmas 11 and 12 in [ABR09], and are proved using 
some of the same ideas. Whereas lemmas in [ABR09] use heavily the fact that a random walk 
0, Si, . . . , Sn can be broken into independent sub- walks 0, Si, . . . , Sj andO, Sj-^-i—Sj, . . . , Sn—Sj, 
in our situation the analogous subsequences 0, 14^1, ... , Wj and 0, Wj+i — Wj, . . . , Wn — Wj are 
not independent. We circumvent the lack of independence by instead using Fact 4.3. 

Lemma 4.4. Uniformly for S>0, O^k^n and a ^ 0, 

^{La{Vn,k)\Wn{^rn,k) = S} « (W-^)' + 1 ^ 
^{Raiyn,k)\Wn{Vn,k) = S} « ^ ^ 

Remark: Most likely, the exponent "6" can be replaced with "2", in analogy with results from 
[ABR09] about ballot theorems for random walks. 

Given that La{^n,k) holds, it is likely that Wi — {i/n)Wn remains large when i is far from 1 and 
far from n. It is also likely that hj is not too large when j is small, and similarly hn — hj is not large 
when j is near n. The next two lemmas make this very precise. 
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For V £Tn, define the events 

Ba{v) = {3m G [a^°, n - a^°] : Wm{v) ^ {m/n)Wn{v) + mm(m, n - m)^/^°} 

and 

Da{v) = {3j : hj{v) > 3aj or hn{v) - hj{v) > 3a{n - j)}. 
Lemma 4.5. Uniformly for < fc < n/2, n/10 ^ S and a ^ 1, 

Lemma 4.6. Uniformly for ^ k ^ n/2, n/10 ^ S ^n and a ^ 0, 

P{i^aKfc),I?aK,fc)l^n(v„,fe) = 5} < — . 

n 

Proof of Lemma 4.4. It suffices to prove the lemma when a ^ 10. We also assume a ^ n^/^, or 
else the conclusion is trivial. Finally, in light of Proposition 4.1 we may assume without loss of 
generaUty that S = n + k, so that n ^ S ^ 2n. 
Let 

2 , r7 _ /„n / „ , r> _ 7/ I , oi \ n' + k' I anX 



m = a , I = [km/n] , n' = n + 2m, k' = k + 21, A = 



a = 



S 

We remark that m, I ^ n^/^, aA/2 ^ a' ^ aX, and for n large enough 1 ^ A ^ 3. 
By Proposition 4.2, 

A ■.= F{Lo{^rn',k')\Wn'{yrn',k') = An'} = ^. (4.3) 

Now let W = {W[, . . . , , W^,) be a sequence with law W„',fc'- We bound A from below by 
counting only sequences with h'^ = m + l and = (n' + A;') — (m + In this way, we can 

break W' into three subsequences, namely 

W, where Wj = hj = h'j (1 ^ i ^ m), 

W, where Wj = T^j+^ - W;,, hj = h'j^^ - h'j {1 ^ j ^ n), 

W, where Wj = W^, - W'^,_j, hj = n' + k' - h'^,_j {l^j^m). 

That is, W captures the first m steps, W the next n steps, and W the last m steps taken in reverse 
order. 

We'll work with four events: 

Ei = {h'^ = m + I, h'^,_^ = {n' + k') - (rn + I)}, 

E2 = {Wj ^ Xj {j ^ m), Wm-Xme [a', 2a']}, 

Ez = {Wj ^ Xj {j ^ m), Wm-Xme [-3a', -2a']}, 
E^{x) = {Wj ^Xj-x {j < n)}. 

Given Ex, W and W have law Wm,i, and W has law yVn,fc> and all three are independent. Also 
given El, the events E2, E^ and £'4(.t) are independent. Thus, 

A ^ F{Ei I VF;, = Xn}F{E2\Ei.W'„, = An }P{^3l^i, W'^, = An'} 

inf F{E4{x)\Ei, W^, = An', Wn = Xn - x - y}. (4.4) 
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Since m + I = 0(n^^^), if k > then a slightly tedious but routine computation with Stirling's 
formula and (3.1) gives 

I W, ^ A„'> - mm X ^' - 'V..^ \. (4.3) 



/n+2m+2l+k-l\ \ I J (ji J^. ]^ym+2l I 

When /c = 0, trivially | W^, = An'} = 1. For the remainder of the proof we write P'^ {•} to 

mean P{- | W'^, = An'}. Next, 

r {^2} = P' \Wj ^ Aj (j < m)|H^^ - Am G [a, 2a']} P^ - Am G [a', 2a']} . (4.6) 

Given that W^, = An' and /i^ = m + Z, has distribution 

An' • Beta(m + Z, n + A; — m — Z), 

and in particular has mean 

(m^ \ 
— j = Am + 0(1) 



and variance 



TvJT ^ = 0(m). 

^ ^ (n + A;)2(n + A: + 1) ^ ^ 



Since a' ^ | ^ \\P^^ it follows from the definition of a Beta random variable that the second 
probability on the right side of (4.6) is S> 1. Applying Proposition 4.1 followed by Proposition 4.2, 
the first factor on the right side of (4.6) is 

^ inf Y'^j ^ Aj (i < m)|T%„ = Am + x} 

^ inf P{ivo(v^i)|iy^(v^i) = Am + x} = — . 

a'^a;^2a' m 



Therefore, 
Similarly, 



P^{£;2}» = ^. (4.7) 



m a^ 



P'={£;3}> inf P{Lo(v,„,0|W^m(v™,0 = Am + y} = - = 4- (4-8) 

-3a'^?;^-2a' ma"' 

Lastly, for a' ^ x ^ 2a' and —3a' ^ y ^ —2a', Proposition 4.1 yields 

r^{E^{x)\Wn = An-x-y} = p|^^ ~ (j ^ n) 

yWn Xn-x -y 

= F{La{Yn,k)\Wn{Vn,k) = S} . (4.9) 

Together, (4.3)-(4.9) imply 

^ > \r{La{^r^,k)\Wn{^n,k)=S}, 
n a" 

which proves the first assertion of the lemma. The proof of the second part is identical. □ 
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Proof of Lemma 4.5. Fix k, S, and a as in the statement of the lemma. We write Wm = l^m(vn,fc), 
hm = hjn{'Vn,k) SO on. If > n/2 then there is nothing to prove so we assume ^ n/2. 
For ^ m ^ n — a'^^ and I ^ 0, let 

Am,i = P {i^a(vn,fe), ^^^'^ min(m, n - m)^/^° = S, /i^ = m + /} . 

Break (VFi, . . . , into two sequences: Wj = Wj for j ^ m, and Wj = Wn — Wn-j for 
j ^ n — m (the latter being the final n — m steps taken in reverse). Given hm = hm — ho, these 
sequences are independent by Fact 4.3. We write P*^ {•} for the conditional probability measure 
P{'|/im = m + I}, and E'^ {•} for the corresponding expectation operator. Also, let A = 

Suppose first that a^o ^ m ^ n/2. Put6 = m^/"^"^ and a' = a^. Note that E'^ ^Wm \ Wn = 

S ■ {m + l)/{n + k). Rescaling by {n + k)/S (this is allowed by the comment just after Proposition 
4.1), by the definitions of b and a' we have 

Am,i ^ P' {Wm - Am e i-a', b]\Wn = An} 

sup P'^ \Wj ^ Xj - a' {j < m)\Wm = Xm + x} 

sup F"" \Wj ^ Xj + a' {j < n - m)\Wn-m = X{n - m) + x} . (4.10) 



X 



X 



Given that Wn = Xn and km = m + I, Wm has distribution An • Bcta(m + l,k + n — m — I) and 
so the first factor on the RHS of (4.10) is 0{b/^/m,) uniformly in I and in n. Applying Proposition 
4.1 and the first inequality of Lemma 4.4, the second factor on the RHS of (4.10) is 

{ Wj j a' + bj/m , ,1 
\Wm mA + 6 ' J 

= P{L„,+6(v^,0|Ty„(v^,/) =mA + 6}< ^ < -, 

m m 

so the product of the first two factors on the right-hand side of (4.10) is 0(6^m~^/^). Similarly, by 
Proposition 4.1 and the second inequality of Lemma 4.4, the third factor on the RHS of (4.10) is 

\ Wn-m n-m X(n-m)-b 

Combining these bounds, we obtain that when a^^ ^ m ^ n/2, Am,i ^ 6^^/(nm^/^). The 
estimation of Am,i with m > n/2 is identical, by reversing the roles of W and W. Therefore, 

P(La(v„,fe),S„(v„,fe)|W-„ = 5)< Y.^{hm=m + l}Am,i 

1 ^13/40 1 

<- V ^7^<— • ° 
n m'^/^ na' 

Proof of Lemma 4.6. As before, we write = W„(vn,fe), /ij = hj{vn,k) and so on. We may 
assume a ^ 10, or else the conclusion follows from Lemma 4.4. We also assume A; ^ 1, or else 
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hj = j for every j and Da{^n,k) is impossible. For fixed j, given hj the sequence {Wi, . . . , Wn) 
breaks into two independent sequences W, consisting of the first j steps, and W, consisting of the 
last n — j steps taken in reverse. If Wn = S and La{yn,k) holds, then there is an integer b ^ —a — 1 
so that Wj - i-^S e[b,b+ 1]. Consequently, Wn-j - G [-b - 1, -b]. 

Fix h such that h > 3a j and suppose that hj = j - note that in this case j < ^ Given 
that hn = h and Wn = S, Wj has distribution S • Beta(/i, n + k — h). Since k ^ n/2 and ^ n, it 
is then straightforward to check that F ^Wj b \ hn = h,Wn = ^ e~''/^ for b ^ Ah. We also 
have 

F {Wi ^ -S - (a + b) (i ^ n - j) Wn-j - ^^^S G [-b - 1, -6], /i,- = h,Wn = s\ 
y n n J 

^ F I L2a+biyn-j,k-h+j) Wn-j{Vn-j,k-h+j) —S G [-6 - 1, -b] > 

^{2a + bl 
n 

by Lemma 4.4 if 6 ^ n^/^, and trivially otherwise. Summing on b, we find that 

F {La{yn,k)\hj = h,Wn = S} « ^ > + ^ ^ « 

Note that (/ii, . . . , /i„) is independent of Wn and so P {hj = h \ Wn = S] = F {hj = h]. Since 
h — j ^ k ^ n/2,hy Stirling's formula 



'{hj = h}=(^ 



, i\ (n+k-h-l\ 
h — l \ \ k-h+j I 



k-h+j 



^ h^ {n-l)---{n-j)-k---{k-h + j + l) 
^ j\ {n + k-l)---{n + k-h) 

the last inequality holding at least for a ^ 5 (which we have assumed). Summing over h > 3aj, 
then over j, we find that 

F{La{Yn,k), 3j : hj > 3aj \Wn = S} ^ « ^"^'^^^ 

Next, suppose h = hn — hj > 3a(n — j), in which case n — j < Let 6' = Wj — ^S. Since 
Wj+i ^ for all i, Wj ^ 5 and so b' ^ ^^'S' ^ n — j. Also, in order for Ln^k{o) to occur we 
must have h' ^ —a. Thus, writing X = [—a, n — j\, and ignoring the last n — j steps of W for an 
upper bound, we have 

F {La{yn,k)\K - hj = h,Wn = S} 

^ supF {Wi ^ -S -a(i^ j) W = + b', hn-hi = h\ 
b'ei [ n ' n J 

^ SUpP i La+b'(yj,n+k-h) Wj{vj^n+k-h) = -S + b'\ , 

b'ex L n ) 
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Note that a ^ n/2 (or else 3a > 3n/2 > n + k and Da{^n,k) is impossible). Since j ^ 
and 6' ^ — a ^ —n/2, by Lemma 4.4 and straightforward manipulations the last probabiUty is 

0(^(a + b'f) = 0(^(a + n - j)^). Also, ¥ {K - hj = h} = F{hn-j = h} < e'^l'^ by the 
same calculation as in (4.1 1). Summing over h > 3a(n — j) and j ^ n — 1 gives 

P{^a(v„,fc), 3j : hn - hj > 3a(n - j) |W„ = S} < 

Together with (4. 12), this completes the proof. □ 

5. The lower bound in Theorem 1 . 1 

We continue to adopt the notational conventions from the previous section. Let c be a sufficiently 
large positive constant, and b = e"^/^. Let 

\k—n/e\^^/6nlogn 

and put m„ = f + ^'gf ^ nT-n — c, then one of the following must occur: 

(i) For some v G r„, S{v) < m„ — log n; 

(ii) For some k satisfying \k — n/e\ > ^/Wn\o%n and some v G Tn^k, S{v) ^ m^; 

(iii) For some v G y„, m„ — logn ^ 5(1;) ^ m„ and Wj ^ {i/n)Wn — logn for some i; 

(iv) For some v G y„, m„ — logn ^ S'(v) < m„ — c and ^ {i/n)Wn — b for all z; 

(v) For some v ^ Yn and some integer a G [6, logn + 1], m„ — logn ^ S{v) ^ m„, Wj ^ 
{i/n)Wn — a for all i and Wj < {j/n)Wn — (a — 1) for some j (write F^j for the event 
that this occurs for a given a and j with j minimal, and note that these events are disjoint). 

By Lemma 3.2 and Stirling's forumula, the probability of (i) is at most E{T„(m„ — logn)} = 
0{n^-''). The probability of (ii) is 0{n-^/'^) by Lemma 3.1 (b). If (iii) occurs, then Mi ^ 
{i/n)mn — logn, and this happens with probability at most E{Tj((i/n)m„ — logn)}, which is 
0(n^/^~^z^^/^) by Lemma 3.2. Summing on i, we find that (iii) occurs with probability 0{ii?^^). 

To bound the probabihty of the event in (iv), we write Ek for the event that there is u G Tn^k for 
which rUn — logn ^ S{v) ^ m„ — c and Wi ^ {i/n)Wn — b for all i, so that by a union bound 
and Lemma 4.4, the probability of (iv) is at most 

E ^{^*^} 

|fe— n/e|^\/6nlogn 

^ E |r„,fc| P {mn - logn ^ 5(v„,fc) ^ mn - c,Lb(v„,fc)} 

|fe— n/e|^\/6nlogn 

^ E |r„,A;| IP {"T-n - logn ^ ^(v^ fc) ^ mn - C} • — 

■"^ — ' n 

n/e|^v'6nIogn 

^ - • E{r„(m„ - c)} 

< e(2-^)^ (5.1) 

(This line of argument will arise again in bounding (v), and we will omit the details.) 

Finally, we bound (v). To do so, we are forced to separately treat j in three different ranges. First 
suppose j ^ a^*^. If j occurs then Mj ^ ij/n)mn — a, the probability of which is 0( j~^/^e~^") 
by Lemma 3.2. Summing on a and on j ^ a^'^ gives a total probability of 0(e~^^) for this range of 
parameters. 
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Next suppose that < j < n— a^°, so that (min(j, n— ^ a. If Fqj occurs then for some 
V e Yn, La{v) and Ba{v) both occur. Note that for n large enough n/10 ^ rUn — log n ^ ^ n, 
and for all k for which r„ C we have ^ k ^ n/2. Thus, for such n, A;, and a, we may apply 
Lemma 4.5 to see that 

^{La{'Vn,k),Bai-Vn,k) \ - log n ^ W„(v„,jt) ^ m„} < 

Further, the expected number of w G y„ with S{v) ^ m„ is 0(n) by Lemma 3.2. By these two 
bounds and a reprise of the argument leading to (5.1), we see that for a given a, the probability of 
Uje[a4o n-a^o] 0{l/a^) and summing over integers a G [6, log n+1], gives a total probability 

of 0(1/6^) = 0(e-2^). 

Now suppose Fa J- holds with j G [n — a^°, n] and a G logn + 1]. By the definition of Faj, 
letting w be the unique ancestor of v in Tj, the event La-\{w) also occurs. Since j ^ n — (log n + 
1)^°, for n sufficiently large \mj — {j/n)mn\ ^ 1 and hence S{w) ^ rrij + 1 — {a — 1). On the 
other hand, for any integer k' ^ 1, by Lemma 4.4 we have 

P ^ + 2 - a, L„_i(vj- fcO} < y ^ {^iKfc') ^ m^- + 2 - a} 

By Lemma 3.2, it follows that 

^{Fa,j} < y ^ I^jK- + 2 - a)| < a^e-"". 

Summing first over j G [n — a^°, n], then over a G [6, logn + 1], we see that the probability Fa^j 
occurs for any a and j in this range is 

< 646g-e6 ^ exp{(46/3)c - e^+"/^} < e''^", 

as long as c is large enough. Combining the three ranges, we obtain that (v) occurs with probability 
^ e"^*^. Altogether, the probability that one of (i)-(v) holds is ^ e^^"^)'^, which is less than 1/2 if 
c is chosen large enough. Hence, M„ ^ m„ — c. 

6. The upper bound in Theorem 1.1 

For the upper bound for median(M„), we use a second-moment method. By the Cauchy-Schwarz 
inequahty, for any non-negative random variable X, 

nx > 0} ^ ||f . (6.1) 

When X is the size of some random subset A" of a ground set Vq, we may rewrite (6.1) using the 
fact that 

EX2= ¥{veX,weX}=YK[X\veX]¥{veX}, 

v,w&Vo v&Vo 

so that 

[EX]2 EX 
^ ^ ^ T..eV,nX\veX]nv^X} ^ sup„,v„E[X| t;G;f]- ^ 

Let a be a large positive constant. Let Vb = where is defined as in the previous section, 

and let X be the set of nodes in f G l^n satisfying 

(i) mn - I ^ S{v) ^ nin, 

(ii) La{v), 

(iii) neither Ba{v) nor Da{v). 
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Taking X = |Af |, by Lemma 3.1 (b), plus Lemmas 4.2, 4.5 and 4.6, we have 

EX ^ E[r„(m„) - T^m. - 1)] Q - O - O - O » 1 (6.3) 

if a is chosen large enough. 

Recall that for all v G 1^, \k{v) — n/e\ < -\/6nlogn. For fixed v G 1^, we need to estimate 

E{X \ v£X}. 

The definitions of the coming two paragraphs are for the most part depicted in Figure 1. Write 
j = j{v,v') for the integer ^ j < n such that v and v' are descendants of two distinct children 
of some node w = w{v, v') G Tj (and let j{v, v') = n if v = v'). In other words, j{v, v') is the 
generation of the most recent common ancestor of v and v' . Supposing ^ j{v, v') ^ n — 1, let x 
be the unique child of w on the path from wtov'. 

_Also^write W = W{v) and W = {Wi, . . . ,W^) = W{i/). Letj = n-- {j{v, v') + 1), let 
Wi = Wi{v, v') = - Wy'+i for 1 ^ z ^ and let W = (1^1, . . . , Wg), so in particular 

Finally, let let k' = k{v'), let ki = ki{v,v') = k{x) — k{w) and let k2 = k' — k{x), so 
k\ + k2 = k' — k{w). Note that once g and /c2 are fixed, W is independent of W and has law 'VVg^k2 • 

For integers j, ^ j ^ n, let J^j = Tj{v) = {v' G X,j{v,v') = j} and let Fj = Fj{v) = 
E{|7v, | I V G X}. Clearly, F„ = 1, as j — n implies v = v' . 

Now fix v' . If v' G X, then by (i), (ii) and (iii), we have 

k{x) ^ 3a(j + 1), k' - k{w) ^ ^a{g + 1) = 3o(n - j), 
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and so 

ki + k2^Sa{n-j), fci ^ min (3a(j + 1), 80(5 + 1)) (6.4) 
and ifveX then, with j = j{v,v'), we have 

j, , I (—a) whatever the value of 7 

Wj > Hmn - 1 + r . . .u/40 -f 40 ^ ■ ^ 40 (6.5) 

Consider separately four ranges of j. First, if n — ^ j ^ n — 1, then for sufficiently large n, 
(6.4) imphes that ki + k2 ^ 3a{n — j), so Fj is deterministically at most 

Hence, recalUng that a is now a fixed, large constant, 

E « 1. (6.6) 

Next, let r = (2 log n)^°. lfn — r<j^n — then for n sufficiently large, j ^ n — j = ^ + 1, 
and (6.5) imphes that in order to have ^ run we must have 

W, ^ - ^V40 + i^g/e- ^1/40 + 2, 

n 

the second inequality holding for sufficiently large n. For fixed ki, by Lemma 3.2 we thus have 

E {|{^;' G Af,i(t;, v') = j, ki{v, v') = ki]\ | G Af} ^ ^Tg{g/e - 5^40 + 2) 

Using (6.4) to bound ki and summing over j yields 

E E a(9 + l)exp[-e5i/^0]«l. (6.7) 

Next, suppose r ^ j ^ n — r. By (6.5), in order to have ^ m„ it must be that 

Wg ^ ^^rrin - min(j, n - jf/^^ + 1 ^ ^ - logn. 
n e 

Since we also require ki{v, v') ^ 3an by (6.4), we have Fj ^ 3an'E,Tg{g / e — logn) <^ 1/n^ for 
this range of j, and hence 

Finally, suppose ^ j ^ r. Here — r — l = n + 0((logn)^°), and since La{v) holds by 

assumption, ifv ^ X then 

^ - a > - (a + 1). 

n n 

For each integer 6 G [—(a, + 1), 2 log n), let E^, be the event that Wj — {j /n)mn G [&, ^ + 1)- Also, 
let £■* be the event that Wj - {j/n)mn ^ \2 log n] . The events {£^6 : -(a + 1) ^ 6 < 2 log n} 
and E* together partition the event {v' G J^j{v)}, so by conditioning 

Fj ^ma.x(E{\J='j\\v e A:,E*} , max E {{J^jl \ v e X , Eb}] (6.9) 

\ — (a+l)^6<21ogn / 
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If Wj ^ (j /n) mn + 2 log n, then to have G J^j, we must have ^ g/e — \ognso, as in 

the case r ^ j ^ n — r,v/e have 

E{|J-,-| \ v^X,E*}^ ^. 

Now suppose Wj — {j/n)mn G [6, 6 + 1], where b is an integer satisfying —a — 1 ^ b ^ 21ogn. 
Note that if 6 < {j^^^^ — 2) and a*^*^ ^ j ^ r then J-'j{v) is necessarily empty due to Ba(v), 
so for such j and b, \ v e X,Ei,} = 0. For the rest, we further subdivide Tj, writing 

Tj^i = {v' e X,j{v, v') = j, ki{v, v') = I}. By (6.4) we have 

K3a{j+1) 

Suppose additionally that — Wj{v') e [A, A + 1], where A is a non-negative integer. 

Since Wj{v') = Wj{v), in order to have v' G J-'j, by (i) we require'^ 

Wg - ^run G [mn/n - (6 + A + 3),m„/n - (6 + A)]. 

Since ^ mn/n < 1 and, for n sufficiently large, — 1 ^ {g/n)mn ^ nig, this implies that, 
writing I = [-{b + A + 4), -(6 + A - 1)], we must have 



Wg — nig e I. 



By (i) and (ii), we also require 



Wi ^ -Wn -b- A- -m„ - 6 - A - a - 3 (i ^ g). 

n g " 

This implies that for all i ^ g, 

Wi ^ ^-Wg - max(6 + A + a - 3, a - 2). 
None of this depends on I, so for any 0^1^ 3a(j + 1), writing m = max(6 + A + a — 3, a — 2), 

l^fe2<3a(j+l) 



< E (K{r9,fe2K-(& + A-l)} 



Kfe2^3a(i+1) 

• supP{Lm(vg,fc2) I Wg{-Vg^k2) = nig + x] 
xex 

/ /, . max(6 + A + a — 3, a — 2)^ 
E{r,K-(6 + A-l))}.j ^ ^ L 

A^O 

V- ^^-e{b+^) . • . max(b + A + g - 3, g - 2)6 

A^O 

«ie->+|6|)6, 



^The mn/n terms come from the "skipped step" from Wj to Wj+i, and the (6 + A + 3) comes from 6 + 1, A + 1, 
and the requirement that S(v) ^ m„ — 1. 
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the third-to-last line by Lemma 4.4 and the second-to-last by Lemma 3.2. Summing over ^ / ^ 
3a{j + 1), it follows that 

E{|J-^-| I V £ X,Eh} <:fe-''\a+\b\f. 

For j ^ a^^ this is 0(1) uniformly in b. When j > a*^ we also have 6 ^ j^/^*^ — 2 and for such j, 
the above bound is 0{j^e~^^^^'^°). By (6.9) it follows that for such ^ j ^ r, 

fl ifj^a^o 

1 max(n~^, j'^ exp(— ej-*^/"^*^)) ifj>a^°. 

Summing on j, we find that 

^ Fj « L (6.10) 

Together, (6.6),(6.7),(6.8), and (6.10) imply that for every veT, 

E[X :veX] = 0(1). 

Combining this estimate with (6.2) and (6.3), shows that F {X > 0} > 1, and if X > then 
Mn ^ m„, so there exists an absolute constant e > such that for all n. 

From here it is straightforward to show that M„ ^ M„ + 0(1), and we now do so. The next two 
lemmas, taken from [FKLIO], are standard bounds for BRW. As the proofs are short, we include 
them here. 

Lemma 6.1. For any BRW, positive integers m, n and positive real numbers M, N, 

F{M.,n+n ^M + N}^ E[(P{M„ ^ iVD^-^^)]. 

Proof. Suppose Mm+n ^ M + N and Tm{M) = k. For each of these k individuals, all of their 
descendants in generation m -I- n are offset from their generation m ancestor by at least N. □ 

Lemma 6.2. Let m, n be positive integers and let M > 0,e > be real. IfF.{{l - £)^'"(^^)} < i 
then P{M„ < -Mn_|_^ — M} ^ s. In particular, the conclusion holds ifF{Tjn{M) < 1/e} ^ |. 

Proof. Let q = sup{x : P{M„ < x} < e}; then P{M„ < q} ^ e. By Lemma 6.1, 

F{Mm+n ^M + q}^E [(P{M„ ^ g})^-^^)] < ^. 

Therefore, M + q^ Mm+n, and thus P{M„ < Mm+n - M} ^ F{Mn < q} ^ e. To prove the 
second part, assume that P{r^(M) < 1/e} ^ |. Then 

E {(1 - efrn(M)^ ^ F{Tm{m) < + (1 - F{Tm{M) < i})(l - e)V- ^ 1 + 1 < 1. □ 
Now take A such that P {Ti{A) < 1/e} ^ |. By Lemma 6.2, 

P {m„ ^ M„ - ^} ^ P {m„ ^ Mn+i - ^} ^ £, 
and hence M„ ^ M„ -|- A, which completes the proof of the upper bound in Theorem 1.1. 
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7. Proof of Theorem 1.2 

Let a > 1/e and < 77 < ae/2. By Biggins' analog of Chemoff's inequality for the BRW 
[Big77, Theorem 2], for large r we have F{Tr{ar) ^ (ae — 77)''} ^ |. Let ro be large enough 

that, in addition, Mn+r ^ Afn + — rj)''^ for all r ^ ro and all n (such an ro exists by Theorem 
1.1). Now fix r ^ ro and let M = ar, let m = r, and let e = (ae — r?)^*^. We then have 
F{Tm{M) < 1/e} ^ 1/5, so for all n, by the preceding bound for Mn+r and by Lemma 6.2, we 
obtain that 

P{M„ ^ M„ - (a - 1/e + r])r} ^ P{M„ ^ M„+^ - ar} ^ {ae - 77)"^ 

The first estimate follows with ci = . Fix a, let 77 ^ 0, then let a ^ 1/e, so that ci e. 

This proves the first part of Theorem 1.2. 

For the second part, fix < e < 1/50 and let 5 = e^, so that 8{l + log((l - e/5)/5)) < e/5. 
Then choose ro sufficiently large that for all r ^ ro, we have (1 — £:/5)r + 2 [log(2r)] < r, and for 
all s ^ log(2ro), we have '¥'{Ts{2s) ^ 4*} ^ e"^/^ (as in the first part, such an ro exists by [Big77, 
Theorem 2]). 

Recall that if /i G = Ti is a child of the root in T then S{h) is Gamma(/i) distributed. Thus, 
for any positive integer r, by a union bound 

P {ri((l - £/5)r) ^ 5r - 1} ^ ^ ¥{S{h) ^ (1 - e/5)r} 

h^Sr 

_ ^ e-(^-^/^>((l-£/5)r)^ 

^ g-(l-£/5)rg(l+log((l-£/5)/5))5r- 
^ g-(l-2e/5)r^ 

the second-to-last inequality by Proposition 3.3. 

Write s = [log(2r)] , and let E be the event that there are at least 4^* nodes in Tg+i with displace- 
ment at most (1 — e/5)r -I- 2s < r. If ri((l — e/5)r) > Sr then either E occurs, or else for each 
h ^ \S7- — 1] , the number of i; G Tg+i descending from G Ti with 5(7;) — S{h) ^ 2s is less than 
A". The latter event has probabihty less then (e-(V'5))'5''-i = e^^/'^)"''. It follows that 

F{E^} ^ e-i^-'^-l^y + e(V'5)-'- ^ e-^^--l-^)\ 

the last inequality holding for large r. Finally, if Mn > M^-^s+i) + then for each node v G Tg+i 
with S{v) ^ (1 — £/5)r -|- 2s, for all w e Tn descending from v we must have S{w) — S{v) ^ 
Mn-{s+i)- If E occurs then there are at least 4"^ ^ 2r such nodes v, and so 

P {Mn > Mn-^s+i) +r}^ e-^^-'/^> + 2-^^ < e'^^-'^ , 

the last inequality holding for large r. Since Mn-(^s+i) ^ the second part of Theorem 1.2 is 
proved by letting £ ^ 0. 

Acknowledgements. The authors thank Hugh Montgomery for bringing paper [Hal44] to our 
attention. 
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